Global Nash Convergence of Foster and Young's Regret Testing

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Global Nash convergence of Foster and Young's regret testing

We construct an uncoupled randomized strategy of repeated play such that, if every player plays according to it, mixed action profiles converge almost surely to a Nash equilibrium of the stage game. The strategy requires very little in terms of information about the game, as players’ actions are based only on their own past payoffs. Moreover, in a variant of the procedure, players need not know...

متن کامل

Regret testing: learning to play Nash equilibrium without knowing you have an opponent

A learning rule is uncoupled if a player does not condition his strategy on the opponent’s payoffs. It is radically uncoupled if a player does not condition his strategy on the opponent’s actions or payoffs. We demonstrate a family of simple, radically uncoupled learning rules whose period-by-period behavior comes arbitrarily close to Nash equilibrium behavior in any finite two-person game.

متن کامل

Regret Testing: A Simple Payoff-Based Procedure for Learning Nash Equilibrium∗

A learning rule is uncoupled if a player does not condition his strategy on the opponent’s payoffs. It is radically uncoupled if the player does not condition his strategy on the opponent’s actions or payoffs. We demonstrate a simple class of radically uncoupled learning rules, patterned after aspiration learning models, whose period-byperiod behavior comes arbitrarily close to Nash equilibrium...

متن کامل

Regret Testing: A Simple Payo¤-Based Procedure for Learning Nash Equilibrium1

A learning rule is uncoupled if a player does not condition his strategy on the opponent’s payo¤s. It is radically uncoupled if a player does not condition his strategy on the opponent’s actions or payo¤s. We demonstrate a family of simple, radically uncoupled learning rules whose period-by-period behavior comes arbitrarily close to Nash equilibrium behavior in any …nite two-person game. Keywor...

متن کامل

Regret-Minimizers and Convergence to Price-Taking

This paper studies a variety of forms of regret minimization as the criteria with which traders choose their bids/asks in a double auction. Unlike the expected utility maximizers that populate typical market models, these traders do not determine their actions using a single prior. The analysis proves that minimax regret traders will not converge to price-taking as the number of traders in the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: SSRN Electronic Journal

سال: 2004

ISSN: 1556-5068

DOI: 10.2139/ssrn.678622